Audio Super Resolution


Audio super resolution is the process of enhancing the quality of audio signals by increasing the sampling rate or frequency.

HQ-SVC: Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios

Add code
Nov 15, 2025
Viaarxiv icon

UniverSR: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching

Add code
Oct 01, 2025
Viaarxiv icon

Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge

Add code
Oct 23, 2025
Figure 1 for Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge
Figure 2 for Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge
Figure 3 for Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge
Figure 4 for Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge
Viaarxiv icon

Ambisonics Super-Resolution Using A Waveform-Domain Neural Network

Add code
Aug 01, 2025
Viaarxiv icon

Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning

Add code
Aug 06, 2025
Figure 1 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 2 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 3 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 4 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Viaarxiv icon

ClearerVoice-Studio: Bridging Advanced Speech Processing Research and Practical Deployment

Add code
Jun 24, 2025
Viaarxiv icon

FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System

Add code
Mar 26, 2025
Figure 1 for FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System
Figure 2 for FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System
Figure 3 for FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System
Viaarxiv icon

FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation

Add code
Jan 18, 2025
Figure 1 for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
Figure 2 for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
Figure 3 for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
Viaarxiv icon

FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching

Add code
Jan 09, 2025
Figure 1 for FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching
Figure 2 for FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching
Figure 3 for FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching
Figure 4 for FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching
Viaarxiv icon

AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models

Add code
Nov 11, 2024
Figure 1 for AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models
Figure 2 for AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models
Figure 3 for AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models
Figure 4 for AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models
Viaarxiv icon